Method for Generating Reliability Tests Based on Orthogonal Arrays and Field Data

ABSTRACT

A method for generating reliability tests for a telephone system is based upon sampling an orthogonal array which covers various combinations of test parameters. Field data is collected of actual telephone activity on a telephone system. The field data is evaluated so as to determine call-mix characteristics. Probabilistic weights for the different call-mix characteristics are obtained, and then the probabilistic weights are used to sample the test case scenarios generated in the orthogonal array which have the same call-mix characteristics. These test case scenarios are used to run tests on the telephone system. These tests are preferably performed using automated test scripts. After the test data is collected, reliability metrics are calculated from the test data.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a method for testing the reliability oftelephone systems, and more particularly to a reliability testing methodwhich is based upon orthogonal arrays and customer field data.

2. Description of the Relevant Art

Modern telephones are software controlled devices that are used in manydifferent environments, such as call centers, investment banking firms,law firms, etc. The same telephone device is typically not used in thesame way in different environments. In other words, an investment bankerdoes not use a telephone in the same way that an operator in a callcenter uses the same telephone. Moreover, each telephone can be in adifferent state depending upon a particular call being initiated orreceived. For example, a telephone can be involved in a point-to-point(“p2p”) call or a conference call, or it can be on hold, or it can betransferred from one caller to another, or it can be idle, etc.Additionally, the state of the telephone can vary depending upon whetherthe telephone is being used to initiate a call or whether it is beingused to receive a call. When all the possible states for an individualtelephone device are considered, it is possible that a telephone canliterally be placed in millions of different states.

Since a telephone can be placed in so many different states, it is verylikely that some of these states may cause the telephone to lock up orfail due to a software programming error. When a software controlledtelephone locks up, it can usually be reset by disconnecting thetelephone from the network, thereby removing electrical power from thetelephone. If the failure or lock up is caused by the telephone beingmoved into a particular state, it is important to discover which statecaused the failure and to correct the software.

In the prior art, reliability tests for software controlled telephonesystems are usually based upon random test generation (operationalprofiles), but these random tests do not guarantee that “good” or“effective” combinations of test parameters are chosen. Hence, the priorart reliability tests do not yield a fast solution that converges to amean-time-to-failure solution with a given confidence factor. It isknown in the prior art to use orthogonal arrays for testing, however,the orthogonal arrays of the prior art only provide all pair-wisecombinations of test parameters, which are typically used to replaceexhaustive system tests. For example, these orthogonal array techniquesmay be based on the so-called Taguchi method, which can reduce thenumber of combinations of test parameters. But the selection of the testparameters are typically arbitrary and do not reflect realistic systemusage in a particular environment. For example, financial institutionswhere there are many long conference calls versus call switch centerswhere there are many short calls that are transferred to other numbersare some examples of different operational environments for telephones.

An example of the Taguchi method is described in U.S. Pat. No.6,604,092, by Steward, entitled, “Expert system utilizing a knowledgebase and design of experiment (DOE) techniques”. Although the Taguchimethod is a powerful tool for testing software and other systems, theuse of the Taguchi method and orthogonal matrices alone cannot providean acceptable solution for the reliability testing of telephone systems,because a telephone has too many possible states that vary widely fromone environment to the next. There are simply too many combinations oftest parameters to effectively test the telephone for reliability ineach environment in which it is expected to be used.

SUMMARY OF THE INVENTION

In the present invention, two techniques are merged to assure aneffective combination of test parameters. By using an orthogonal matrixand field data, mean-time-to-failure and other reliability statisticscan be effectively computed.

The goal of the present invention, therefore, is to generate reliabilitytests for telephone systems using complex software implementations suchas telephone devices implemented using Session Initiation Protocol (SIP)for Voice-over-IP services, as described by Rosenberg, J., Schulzrinne,H., Camarillo, G., Johnston, A., Peterson, J., Sparks, R., Handley, M.and E. Schooler, “SIP: Session Initiation Protocol”, IETF RFC 3261, June2002, (hereinafter referred to as “Rosenberg”). Other complex softwareimplementations include H.323 protocol for packet-based multimediacommunication systems, as described in ITU-T Recommendation H.323,“Packet-based Multimedia Communication Systems,” 2006. The reliabilitytests of the present invention include a number of advantages over theprior art. The reliability tests of the present invention are effective,because they are designed such that they not only mimic/resemble thefield data collected from various sources, but they also stress thetelephones so that good reliability metrics can be obtained. Thereliability tests of the present invention are efficient, since theyconverge to well-known statistical metrics by making use of realisticdata collected from an operational environment. The reliability testsare reusable, since they can be applied to telephone systems by usingcommercially available test application tools (see for example TestComplete Tool developed by Automated QA, 6000. South Eastern AvenueSuite 14-M, Las Vegas, Nev. 89119, hereinafter referred to as “TestComplete Tool”). The reliability tests are repeatable and re-generatecertain failure scenarios where possible since most test systems wherethe reliability tests to be run will have error logging capabilities.Lastly, the reliability tests are remotely-executable, because they areexecutable from remote locations using tools such as PTI developed byAvaya and described in U.S. patent application Ser. No. 11/176,816,filed on Jul. 7, 2005, and entitled “Testing a data-processing systemwith telecommunications endpoints” or the Test Complete Tool fromAutomated QA.

The present invention utilizes an orthogonal array to gathercombinations of test case parameters when randomly choosing test casesbased on operational profiles/field data. At the heart of this is aprocess for generating a testing matrix which will be scheduledaccording to the operational profile of the product under test. This canbe extended to produce a tool which automates such testing and/or aservice by which one tests other companies' products or is used intesting a customer's end-to-end telephony system reliability. Finally,by choosing consistent operational data and testing matrices, multipleproducts can be compared side-by-side to get an “apples-to-apples”comparison of their reliability numbers (mean-time-to-failure,availability, etc).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a high level architecture for a reliability testlaboratory associated with the present invention.

FIG. 2 illustrates a flow chart of the method of the present invention.

FIG. 3 illustrates a portion of an orthogonal matrix.

FIG. 4. illustrates the orthogonal matrix of FIG. 3 and relatedscenarios which are grouped together in buckets.

FIG. 5 illustrates a test selection engine which contains different calltypes.

DESCRIPTION OF PREFERRED EMBODIMENTS

The state space is virtually infinite for covering all possibleinteractions that can happen in a telephone implementing complexsoftware features (e.g., the telephones implementing SIP (See“Rosenberg”) or H.323 services, as described in ITU-T RecommendationH.323, “Packet-based Multimedia Communication Systems,” 2006.). One goodapproximation is using orthogonal arrays for testing all pair-wisecombinations of different inputs. It has been shown in the literaturethat this approach which includes the Taguchi method, which is describedin Phadke, M. S., “Quality Engineering Using Robust Design”, PrenticeHall, 1989, though not exhaustive, is preferable to randomly generatedtest cases since the orthogonal array approach guarantees that allpairwise combinations of inputs are covered by the test cases.

If the test cases of the present invention were solely based onorthogonal arrays, the test cases would lack the so-called “commonlyused scenarios” that users in the field tend to repeat more often thanother input sequences. To produce reliability metrics that reflect theuser behavior, the present invention, therefore, incorporates the fielddata statistics into the orthogonal array approach. This way, the testcases generated by the orthogonal array will be assigned probabilitiesbased on the frequency with which they are observed in the field.Advantageously, the resulting test cases will both satisfy theorthogonality principle and resemble the field data.

Referring now to FIG. 1, a high-level architecture of the reliabilitytest lab 100 used to practice the method of the present invention isshown. Preferably, the test lab 100 includes a server 101 such as aCisco 3640 server which is capable of supporting voice and dataapplications. The sever 101 is coupled to a switch 102 such as Avaya'sCajun 333R switch. The switch 102 is preferably coupled to a pluralityof stackable switches 103, 104 such as Avaya's P333T stackable switch.The stackable switches 103 are preferably coupled to first telephonebank 105 which includes a relatively large number of telephones.Likewise, the stackable switches 104 are preferably coupled to secondtelephone bank 106 which also includes a relatively large number oftelephones. The stackable switches 103 are also preferably coupled to amedia server and gateway 109 such as the Avaya S8700/G650. The telephonebanks 105, 106 are each respectively coupled to DHCP (“Dynamic HostConfiguration Protocol”) servers 107, 108.

In the reliability test lab 100, the number of telephones that should beused is in the order of dozens. The higher the number of telephones usedin the lab, the faster the reliability statistics can be measured.Typically, fifty to one hundred telephones are used in the telephoneindustry due to space and maintenance constraints to implement each ofthe telephone banks of 105 and 106 of FIG. 1. The software tools to beutilized preferably include a test script development phase using toolssuch as PTI (See U.S. patent application Ser. No. 11/176,816) fortelephone screen verification tools implementing Chinese postman toursto generate minimum-cost test sequences as described by Fecko M. A.,Uyar M. U., Amer P. D., Sethi A. S., Dzik T., Menell R., McMahon M., “ASuccess Story of Formal Description Techniques: Estelle Specificationand Test Generation for MIL-STD 188-220,” Computer Communications,Special Issue on Formal Description Techniques in Practice, 23, pp.1196-1213, 2000. This technique utilizes an orthogonal array generatorto generate the test case selection template, and data analysis tools toobtain statistical data from the field (e.g., in a given week,percentage of calls that were point-to-point, conference, transfer,etc.).

Referring now to FIG. 2, a flow chart illustrates the reliabilitytesting method of the present invention. The reliability testing methodincludes seven steps.

In step S1, the reliability test parameters are identified, and thesetest parameters could include call type, duration, registration periodor frequency, postman scenario number, etc. A registration period isoften required by service providers to prevent the use of stolentelephones. Each telephone has to report its user name and passwordperiodically to a registrar switch. This period can vary from minutes todays depending on the registrar operational options. In addition, basedupon the field data, different parameters such as the length of a callbeing put on hold (for example, parameters called long_hold andshort_hold to represent calls placed on hold for a long or short time,respectively) can be identified.

In step S2 an orthogonal array is generated using the Taguchi method tocover all pair-wise combinations of the chosen test parameters at stepS1. The size of the orthogonal array is determined from the number offactors to be studied, the number of levels for each factor, etc. andselecting the appropriate sized array from published tables such asthose found in Chapters 3 and 7 of Phadke, M. S., “Quality EngineeringUsing Robust Design”, Prentice Hall, 1989. A partial representativeexample of an orthogonal array is illustrated in FIG. 3.

The example orthogonal array of FIG. 3 includes four columns and only aportion of the rows and columns which are applicable to each test case.In the first column the test cases are identified by a number, and inFIG. 3 only test cases 128 through 137 are illustrated. In the test lab,the test cases are executed in this sequential order. In the secondcolumn, a maximum of five different scenarios are identified. Adescription of a representative scenario is provided below. In the thirdcolumn the type of connection is identified. In the example of FIG. 3there are five different types of calls are identified: p2p (i.e.,point-to-point calls where person A calls person B), conference (wherethree persons A, B and C establish a 3-way conference call amongthemselves), hold (where person A calls person B, and one of them putthe call on hold), transfer (where person A calls B and transfers thecall to person C), and idle (where the telephone is left idle withoutany incoming or outgoing calls). In the fourth column, differentdurations for the call types are provided. In addition to the threeinputs in the second, third and fourth columns, other inputs, such asregistration frequency, encryption type, transfer type, auto holdfeature, etc., could be added to the orthogonal array. The number ofinputs to be included in the calculation of an orthogonal array, and thetypes of inputs of the orthogonal array including scenarios, call types,and durations are determined by the test designer.

A representative description of the point-to-point call (i.e., p2p)scenarios for the orthogonal array of FIG. 3 will now be provided. Inthis example, there are three scenarios for p2p call types.

-   -   In scenario 1, person A calls person B; B answers; A and B talk        and B hangs up (i.e., B disconnects the call).    -   In scenario 2, A calls B, before B answers A hangs up.    -   In scenario 3, B calls A; A answers, they talk and A hangs up.        It can be appreciated that many other scenarios for other call        types can be generated from the field data.

In step S3 the field data is collected from a desired operationalenvironment. The desired environment may be a call center, an investmentbank, or a law firm, etc., and accordingly the field data will vary fromone desired environment from another. For example, in a call center theduration of the calls may be relatively short and there may be numeroustransfers among the employees of the call center. A call centeroperator, for example, may answer the call and then quickly transfer thecaller to, for example, a sales or service representative. Whereas, in alaw firm, for example, there may be frequent and relatively longconference calls with relatively few transfers.

The orthogonal array generated test cases in step S2 can be envisionedsuch that they are grouped together in different “buckets” asillustrated in FIG. 4. In this illustration, the same call types will beplaced together in the same bucket. The number of tests cases in eachbucket will depend on the orthogonal array output size. For example, ifthere are six hundred (600) steps (or rows) in the orthogonal arrayoutput, there may be one hundred fifty (150) p2p type of steps, twohundred (200) hold type of steps, one hundred twenty (120) conferencetype of steps, etc. In the illustrated portion of the orthogonal arrayof FIG. 3, the one hold scenario (i.e., test case 132) is placed in thebucket for hold scenarios, the one conference scenario (test case 131)is placed in the bucket for conference scenarios, the three p2pscenarios (test cases 128, 130, 133, 135) are placed in the bucket forp2p scenarios, the one transfer scenario (test case 136) is placed inthe bucket for transfer scenarios, and three idle scenarios (test cases129, 134, 137) are placed in the bucket for idle scenarios.

In step S4, the percentage of test cases grouped by call type aredetermined from the field data in order to obtain call-mixcharacteristics from the field data. For example, in a desiredenvironment at any given time a first percentage of the telephones maybe idle, a second percentage may be involved in p2p calls, a thirdpercentage may be involved in a conference call, a fourth percentage maybe involved in a transfer, and a fifth percentage may be on hold. Asillustrated in FIG. 5, each test case may be figuratively represented bya ball and placed in a selection test engine which is represented by theoval 500. It should be noted that, in this representation, the testselection engine 500 is analogous to a rotating lottery bin withnumbered balls. From FIG. 5, it can be readily appreciated that 30% ofthe test cases involve idle telephones, 40% involve p2p calls, 10%involve holds, 10% involve transfers and 10% involve conference calls.

In step S5, probabilistic weights for the different reliability testparameters are obtained from the call-mix data of step S4. Using theseprobabilistic weights, existing automated test scripts 201-205 fordifferent types of calls are randomly chosen and run. These existingautomated test scripts 201-205 can be run for the following differenttypes of calls: hold, transfer, conference, idle and p2p.

Referring again to FIG. 5, in reality, the test selection engine 500 isa suitably programmed digital computer that randomly selects the calltype balls within each group in the test selection engine. Based uponthe ball type, a test case generated from the orthogonal array israndomly selected from one of the five buckets of scenarios of FIG. 4.Once the ball is selected and the test case is performed in accordancewith the appropriate test scripts 201-205, the ball is returned to thetest selection engine 500. If the above described experiments are runlong enough, eventually all the call types or balls from the testselection engine 500 with percentages close to the field data will havebeen performed. Moreover, substantially all of the orthogonal arrayscenarios from the scenario buckets of FIG. 4 will have been selected.

In step S6, failure data is collected from test log files generated inthe performance of step S5. In other words, the actual failure dataassociated with the operation of telephone banks 105, 106 in performanceof the experiments of step S5 are collected from the reliability testlab 100.

In step S7, different reliability metrics are calculated using thefailure data collected in step S6. These metrics, or reliability testresults, are aimed at answering important marketing concerns. A sampleof such marketing issues include: (i) the probability that a telephonewill fail after X weeks, (ii) the probability that a telephone willsurvive after X weeks, (iii) the probability that a telephone willoperate without failure for x weeks, (iv) of the telephones that remainfailure-free after first x weeks of operation, the probability that theywill operate without failure for y weeks. (v) the mean time to failure(“MTTF”), (vi) the required duration of the tests until MTTF can becalculated with a confidence factor of 0.999999, and (vii) for a givennumber of telephones, the required duration of tests to calculate MTTFwith a confidence factor of 0.999999. The techniques for calculatingthis type of data are well known in the art, and are described in textbooks such as Bevington, Philip R. “Data Reduction and Error Analysisfor the Physical Sciences”, McGraw-Hill Book Co., 1969, and Larson,Harold J., “Introduction to Probability Theory and StatisticalInference”, John Wiley & Sons, 1982.

The invention has been described with reference to exemplaryembodiments. However, it will be readily apparent to those skilled inthe art that it is possible to embody the invention in specific formsother than those of the embodiments described above. This may be donewithout departing from the sprit of the invention. The exemplaryembodiments are merely illustrative and should not be consideredrestrictive in any way. The scope of the invention is given by theappended claims, rather than the preceding description, and allvariations and equivalents which fall within the range of the claims areintended to be embraced therein.

1. A method for generating reliability tests for telephone systems,comprising: generating an orthogonal array which includes test caseshaving combinations of test parameters; obtaining call characteristicsfrom field data based upon actual telephone usage; obtainingprobabilistic weights for different call characteristics based upon afrequency of occurrence in the field data; using the probabilisticweights to select desirable test cases from the orthogonal array;running tests on the telephone system using the selected desirable testcases; collecting test data from the tests that have been run; andcalculating reliability metrics for the telephone system based on thetest data.
 2. The method according to claim 1 wherein the tests are runusing automated test scripts.
 3. The method according to claim 2 whereinthe orthogonal array is generated using the Taguchi method.
 4. Themethod according to claim 3 wherein the orthogonal array includes testscases which include at least call types, scenarios, and durations. 5.The method according to claim 4 wherein the call types are selected fromthe group including a transfer, point-to-point, hold, conference, andothers.
 6. The method according to claim 5 wherein the test scripts arerun for the different call types.
 7. The method according to claim 1wherein the reliability metrics are selected from the group including atleast a mean time to failure, a mean time to failure with apredetermined confidence factor, the probability that a telephone willfail after a certain time period, and the probability that a telephonewill survive for a certain time period.
 8. The method according to claim4 wherein the test cases of the orthogonal array are divided into groupsby common characteristics and wherein a test case is randomly selectedfrom one of the groups once the group is selected based upon itsprobabilistic weight as determined from the field data.
 9. A computerprogram on a computer readable medium that generates reliability testsfor telephone systems, comprising: generating an orthogonal array whichincludes test cases having combinations of test parameters; obtainingcall characteristics from field data based upon actual telephone usage;obtaining probabilistic weights for different call characteristics basedupon a frequency of occurrence in the field data; using theprobabilistic weights to select desirable test cases from the orthogonalarray; running tests on the telephone system using the selecteddesirable test cases; collecting test data from the tests that have beenrun; and calculating reliability metrics for the telephone system basedon the test data.
 10. The computer program according to claim 9 whereinthe tests are run using automated test scripts.
 11. The computer programaccording to claim 10 wherein the orthogonal array is generated usingthe Taguchi method.
 12. The computer program according to claim 11wherein the orthogonal array includes tests cases which include at leastcall types, scenarios, and durations.
 13. The computer program accordingto claim 12 wherein the call types are selected from the group includinga transfer, p2p, hold, and conference.
 14. The computer programaccording to claim 13 wherein the test scripts are run for the differentcall types.
 15. The computer program according to claim 9 wherein thereliability metrics are selected from the group including at least amean time to failure, a mean time to failure with a predeterminedconfidence factor, the probability that a telephone will fail after acertain time period, and the probability that a telephone will survivefor a certain time period.
 16. The computer program according to claim12 wherein the test cases of the orthogonal array are divided intogroups by common characteristics and wherein a test case is randomlyselected from one of the groups once the group is selected based uponits probabilistic weight as determined from the field data.
 17. A methodfor generating reliability tests for communication systems, comprising:generating an orthogonal array which includes test cases havingcombinations of test parameters; obtaining operational characteristicsfrom field data based upon actual communication usage; obtainingprobabilistic weights for different operational characteristics basedupon a frequency of occurrence in the field data; using theprobabilistic weights to select desirable test cases from the orthogonalarray; running tests on the communication system using the selecteddesirable test cases; collecting test data from the tests that have beenrun; and calculating reliability metrics for the communication systembased on the test data.
 18. The method according to claim 17 wherein thetests are run using automated test scripts.
 19. The method according toclaim 18 wherein the orthogonal array is generated using the Taguchimethod.
 20. The method according to claim 19 wherein the orthogonalarray includes tests cases which include at least operational types,scenarios, and durations.
 21. The method according to claim 20 whereinthe test scripts are run for the different operational types.
 22. Themethod according to claim 17 wherein the reliability metrics areselected from the group including at least a mean time to failure, amean time to failure with a predetermined confidence factor, theprobability that a communication will fail after a certain time period,and the probability that a communication device will survive for acertain time period.
 23. The method according to claim 20 wherein thetest cases of the orthogonal array are divided into groups by commoncharacteristics and wherein a test case is randomly selected from one ofthe groups once the group is selected based upon its probabilisticweight as determined from the field data.
 24. A computer program on acomputer readable medium that generates reliability tests forcommunication systems, comprising: generating an orthogonal array whichincludes test cases having combinations of test parameters; obtainingoperational characteristics from field data based upon actualcommunication usage; obtaining probabilistic weights for differentoperational characteristics based upon a frequency of occurrence in thefield data; using the probabilistic weights to select desirable testcases from the orthogonal array; running tests on the communicationsystem using the selected desirable test cases; collecting test datafrom the tests that have been run; and calculating reliability metricsfor the communication system based on the test data.
 25. The computerprogram according to claim 24 wherein the tests are run using automatedtest scripts.
 26. The computer program according to claim 25 wherein theorthogonal array is generated using the Taguchi method.
 27. The computerprogram according to claim 26 wherein the orthogonal array includestests cases which include at least operational types, scenarios, anddurations.
 28. The computer program according to claim 27 wherein thetest scripts are run for the different operational types.
 29. Thecomputer program according to claim 26 wherein the reliability metricsare selected from the group including at least a mean time to failure, amean time to failure with a predetermined confidence factor, theprobability that a communication device will fail after a certain timeperiod, and the probability that a communication device will survive fora certain time period.
 30. The computer program according to claim 27wherein the test cases of the orthogonal array are divided into groupsby common characteristics and wherein a test case is randomly selectedfrom one of the groups once the group is selected based upon itsprobabilistic weight as determined from the field data.